Mon. Not. R. Astron. Soc. 000, 000-000 (0000) 



Printed 1 February 2008 



(MN M?eX style file vl.4) 



A Bayesian Classifier for Photometric Redshifts: 
Identification of high redshift clusters 

Tadayuki Kodama\ Eric F. Bell^ & Richard G. Bower ^ 

^ Institute of Astronomy, University of Cambridge, Madingley Road, Cambridge CBS OHA, UK 
^ Department of Physics, University of Durham, South Road, Durham DHl 3LE, UK 



5 October 1998 



ABSTRACT 

Photometric redshift classifiers provide a means of estimating galaxy redshifts from 
observations using a small number of broad-band filters. However, the accuracy with 
which redshifts can be determined is sensitive to the star formation history of the 
galaxy, for example the effects of age, metallicity and on-going star formation. We 
present a photometric classifier that explicitly takes into account the degeneracies 
implied by these variations, based on the flexible stellar population synthesis code 
of Kodama & Arimoto. The situation is encouraging since many of the variations in 
stellar populations introduce colour changes that are degenerate. We use a Bayesian 
inversion scheme to estimate the likely range of redshifts compatible with the observed 
colours. When applied to existing multi-band photometry for Abell 370, most of the 
cluster members are correctly recovered with little field contamination. The inverter 
is focussed on the recovery of a wide variety of galaxy populations in distant (z ~ 1) 
clusters from broad band colours covering the 4000 A break. It is found that this can 
be achieved with impressive accuracy (|Az| < 0.1), allowing detailed investigation into 
the evolution of cluster galaxies with little selection bias. 

Key viTords: galaxies: general - galaxies: evolution - galaxies: stellar content 



1 INTRODUCTION 

' The current trend in cosmology is to explore the properties 
of galaxies at ever fainter limits. This has lead to demon- 
stration of the existence of a substantial galaxy population 
at z > 3 (Steidel et al. 1996, Metcalfe et al. 1996, Stei- 
, del et al. 1998), and the discovery of galaxy clusters with 
■ z ;> 1 (Deltorn et al. 1997, Stanford et al. 1997, Yamada et 
al. 1997). These discoveries have allowed us to extend our 
knowledge of the formation history of galaxies (Madau et 
al. 1996, Baugh et al. 1998, Kodama et al. 1998) and the 
growth of the universe's gravitational structure (Bower & 
Smail 1997). However, while images that reach these depths 
are now relatively commonplace, spectroscopic follow-up of 
these objects is extremely time consuming even on 8m-class 
telescopes. These problems are offset by the multiplex ca- 
pability of multi-object spectrographs (eg. LDSS; AUington- 
Smith et al. 1994) and fibre-optic fed spectrographs (eg. Tay- 
lor 1995), or by surveys targeted at specific redshifts using 
tuneable narrow-band filters (eg. Jones & Bland-Hawthorn 
1997). Nevertheless, even in the best studied deep images, 
only a small fraction of the galaxies have known spectro- 
scopic redshifts. 

Whereas spectroscopic redshifts use sharp absorption 
and/or emission lines to accurately determine the rest wave- 



length of the spectrum, it is also possible to exploit the over- 
all characteristic shape of the spectral energy distribution 
(SED) to estimate the galaxy's redshift. This 'photomet- 
ric redshift' approach can be applied to broad band images 
provided they have sufficiently high signal to noise and ad- 
equately sample the important features of the SED. In par- 
ticular, the 4000 A spectral break and the Balmer and Ly- 
man series limits are important features that arise in almost 
all galaxy spectra. Although precise redshifts cannot be de- 
termined by this method, estimates of (or limits on) z are 
obtained. 

The existing photometric redshift estimators fall into 
three main classes: empirical redshift estimators, those based 
on observed spectral energy distributions and model-based 
redshift estimators. Empirical redshift estimators (Connolly 
et al. 1995) are based on a training set of galaxies for which 
the redshifts and broad-band fiuxes are known. These are 
used to train an estimator, for example a multi-dimensional 
polynomial fit, that predicts the redshift from the input 
fiuxes with minimum error. The disadvantage of this method 
is that it requires a relatively large training dataset with 
high quality colours and known redshifts. This makes it 
difficult to apply beyond the limits of spectroscopic sur- 
veys, although this problem might be alleviated using the 
colours of distant, gravitationally lensed galaxies. However 
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this method, when tested against independent but similar 
data, can give impressive accuracy (cr2~0.06; Connolly et 
al. 1995). 

Lanzetta et al. (1996), Mobasher et al. (1996) and Saw- 
icki et al. (1997) use an approach that is based on the ob- 
served SEDs of galaxies covering a wide range of spectral 
types. Redshifts are estimated from the observed data by 
redshifting each of the templates and determining the best 
match to the observational colours. They emphasise the im- 
portance of using observed templates in order to incorporate 
the effects of dust. This is particularly important for galaxies 
in the redshift range 1 < z < 3 because the optical colours 
increasingly reflect the rest frame ultraviolet spectrum of the 
galaxy. One problem with this approach, however, is that 
the spectral library does not take into account the evolution 
of the galaxy stellar populations. The method can accom- 
modate evolution in as far as it is equivalent to changing 
galaxies between different spectral types, however. 

Model-based approaches use stellar population synthe- 
sis codes (eg. Bruzual & Chariot 1993) to produce model 
SEDs that can then be compared to the observed data. 
For example, Gwyn & Hartwick (1996) used a spread of 
galaxy models from single burst stellar populations to mod- 
els with constant star formation to model present-day galax- 
ies. When generating redshifted model SEDs, the evolution 
of the stellar population is automatically taken into account. 
The redshift of the observed galaxy is determined by min- 
imising 'x'^ residuals. The improved flexibility of this method 
can however, lead to greater errors in the estimated red- 
shifts. This arises because of colour degenaracies between 
the effects of galaxy type and redshift. 

In this paper, we focus more closely on the interrelation 
between star formation history and redshift estimation. As 
we have outlined, photometric redshifts can be susceptible 
to changes in the galaxy stellar population. For instance, the 
effects of age, metal abundance and on-going star formation 
are all reflected in the relative shape of the continuum, par- 
ticularly when it is convolved with the response of standard 
broad band filters. It is important that these uncertainties 
are taken into account when determining the galaxy redshift. 
We develop a method of photometric redshift determina- 
tion that explicitly takes into account the degeneracies im- 
plied by these variations. Clearly, incorporating additional 
free parameters to describe the star formation history of the 
galaxy threatens to make it impossible to extract useful red- 
shift information. However, many of the changes in colour 
caused by different star formation histories are degenerate: 
this is the familiar age-metallicity degeneracy that has long 
plagued the estimation of star formation histories in ellip- 
tical galaxies. We will show that for red galaxies, redshifts 
can be determined under only weak assumptions about the 
star formation history. At lower redshifts, the colours of blue 
(disk) galaxies become considerably harder to disentangle. 

Our approach attempts to deal with, and indeed em- 
brace, this unavoidable degeneracy in colours with variations 
in redshift and star formation history. We explicitly account 
for galaxy metallicities and star formation histories; these 
effects are in many cases degenerate with uncertainties due 
to the stellar initial mass function (IMF), recent star for- 
mation, dust extinction and cosmology. We retain possible 
degeneracies in plausible values of galaxy type and redshift 
by storing a 'probability map' for each galaxy, which can be 



used to estimate a range of acceptable redshifts rather than 
reducing the observed data to a single 'best bet' estimate 
of galaxy type and redshift. In particular, our classifier is 
designed to pick out galaxy cluster members without bias- 
ing the sample to galaxies of one particular star formation 
history. Our motivation is to use this method to study the 
photometric properties of ~ 1 cluster galaxies with as little 
selection bias as possible. 

The structure of the paper is as follows. § 2 introduces 
the stellar population synthesis code of Kodama & Arinioto 
(1997). We derive colour tracks for a range of galaxy star 
formation histories and outline the major uncertainties in 
these tracks. This provides the framework for selecting ap- 
propriate filter sets and required photometric accuracy. § 3 
details our Bayesian approach to the inversion problem. We 
explicitly incorporate a wide variety of possible star forma- 
tion histories, and explicitly incorporate the resulting de- 
generacies in our redshift estimates. The role of the prior 
is discussed. In § 4, we test our method with galaxies in 
Abell 370 cluster field and galaxies with known redshifts in 
the Hubble Deep Field (HDF). § 5 gives an application of 
the method to a simulated cluster at « = 1. A summary and 
our conclusions are presented in § 6. 



2 COLOUR TRACKS AS A FUNCTION OF 
STAR FORMATION HISTORY 

2.1 Model 

The evolutionary population synthesis model of Kodama 
(1997) was used to predict the photometric properties of 
evolving stellar populations. This model calculates the spec- 
tral evolution of a galaxy with an arbitrary star forma- 
tion history, taking into account the chemical evolution in a 
self-consistent way. Kodama & Arimoto (1997) applied this 
model to the elliptical galaxy populations of distant clus- 
ters. In this study, disk models with ongoing star formation 
are considered in addition to the elliptical models. We first 
describe the basic equations and parameters of this model 
and then summarise the elliptical galaxy and disk galaxy 
models. 

2.1.1 Equations and parameters 

We assume that the galactic gas is supplied from a surround- 
ing gas reservoir trapped in the gravitational potential of a 
galaxy and that the gas is always well-mixed and distributes 
uniformly in a galaxy. The star formation is described by 
the following equations. The stellar IMF is given by a single 
power law: 

(j)(m) = Am~'^ , mi<m<mu, (1) 

where mi and m„ are lower and upper limits of initial stellar 
mass respectively. The Salpeter (1955) IMF corresponds to 
X = 1.35. The coefficient A is determined by, 

/ (p{m)dm = 1. (2) 

J mi 

The IMF is assumed to be time invariant. The star formation 
rate (SFR) ip{t) is assumed to be proportional to the gas 
mass Mg{t) (Schmidt 1959): 
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m = -M,(t), (3) 

T 

where t is the star formation time scale in Gyr. Note that 
this formulation gives an exponentially decaying SFR with 
an effective time scale r/a in the case of the simple models, 
where a is the so-called locked-up mass fraction defined by 
Tinsley (1980). The Salpeter mass function (a; = 1.35) with 
mi — 0.1, rriu = 60 M0 gives a = 0.72. The gas infall rate 
£.in{t) depends on the initial total mass of the gas reservoir 
Mt and the gas infall time scale Ti„: 

ein(t) = — exp(— ^) (4) 

Tin Tin 

(cf Koppen & Arimoto 1990). The gas metallicity Zg{t) is 
calculated numerically, using the basic equations of chemical 
evolution (Tinsley 1980) and stellar nucleosynthesis tables 
(Nomoto 1993). The metal contribution from SNIa is also 
considered by fixing their lifetime at 1.5 Gyr. We assume 
that the metal-enriched gas spreads through the galaxy in- 
stantaneously and evenly (the one-zone approximation). As 
the initial conditions, we assume that there is no gas in a 
galaxy before the onset of star formation; ie. Mg(0) = and 
Zg{0) — 0. Using the infall history defined as above, our 
expression for the star formation rate ^p{t) and the metal- 
licity of the stars Z{t) — Zg{t), the integrated spectrum 
of a galaxy can be synthesised as a function of time. By 
specifying the galaxy age, or equivalently its formation red- 
shift, and cosmological parameters, we obtain the spectra 
and therefore colour indices of the galaxy as a function of 
redshift. The cosmological parameters are set to Ho = 50 
km s~^ Mpc~^, fio = 1-0, and Ao = 0.0 unless otherwise 
stated. 

2.1.2 Elliptical galaxies and bulges 

For elliptical galaxy models (E models), we use x = 1.10, 
mi =0.1 M0, and m„ = 60 for the IMF and short time 
scales of star formation and gas infall: t = Ti„ =0.1 Gyr. 
The slope of the IMF differs from the Salpeter value x = 1.35 
to allow the colours of the reddest giant ellipticals to be re- 
produced in the context of this model (with a Salpeter IMF, 
metallicities high enough cannot be achieved). In addition, 
in order to reproduce the observed present-day dependence 
of elliptical galaxy colour on luminosity, it is useful to in- 
troduce another parameter, the galactic wind epoch tgm. At 
this time, the energy put into the ISM in the proto-elliptical 
galaxy by SNe is large enough to overcome the potential 
of the galaxy, resulting in the ejection of the gas from the 
galaxy, ending star formation. We constructed a model se- 
quence of elliptical galaxies as a function of total luminosity 
by simply changing tg„ so that they reproduce the colour- 
magnitude ( C-M) relation of Coma ellipticals inV — K and 
U — V (Bower, Lucey, & Ellis 1992a,b) at the galaxy age 
Tg =12 Gyr. Changing tgw is equivalent to adjusting the 
mean stellar metallicity of the galaxies, therefore we call this 
the metallicity sequence of elliptical galaxy models. In this 
model, the mean stellar metallicity ([M/H]) changes from 
0.06 to —0.52 over a six magnitude range from the bright- 
est E model {My = —23 mag at z = 0). The time until 
the onset of a galactic wind tgw is always shorter than ~0.5 
Gyr, thus the stax formation in elliptical galaxies is burst- 
like. The above model sequence is shown to reproduce the 



-1 



-2 









- 




- 










'■; i "^"-"^ 20 
\\l \ \ 


- 


; ^ ■ 10 


■ 


\ : ^^r-...,,^^^^ ■TTi — i 




ii ^, i \ itSt-M!' 


T-T^^=5 Gyr 






'Is=5 Gyr ' " 







10 \ ^ 

\ IS 

\ 
■A 



0.2 0.4 0.6 0.8 



B - V 



Figure 1. Galaxy disks. Gas mass per unit blue luminosity is 
plotted against B—V colours. The filled circles show the estimated 
disk colours of galaxies ranging between Sa and Im (sec text). 
The solid line and the four dashed line represent the evolutionary 
models with different r and Tin. The age Tq is changed from 1 
Gyr to 15 Gyr as indicated by crosses along the lines. 

evolution of the C-M relation of elliptical galaxies in distant 
clusters in Kodama & Arimoto (1997) and Kodama et al. 

(1998). 

To represent the photometric properties of disk galaxy 
bulges, we borrow the elliptical galaxy models. Observa- 
tional support for this includes the results of Mg2 index 
analysis (Jablonka, Martin, & Arimoto 1996). 

2.1.3 Disks 

For the disk component, the IMF parameters are set to a; = 
1.35, mi — 0.1, m„ = 60 Mq, and longer time scales of star 
formation and gas infall: t = Un = 5 Gyr. The age of a 
galactic disk is fixed at 12 Gyr. The disk model time scales 
are chosen to reproduce the integrated B — V colours and 
Mg/Ls ratio of observed disks of various Hubble- types as 
shown in Fig. 1 (cf. Shimasaku & Fukugita 1997). 

The B — V colours of disks shown in Fig. 1 as a function 
of Hubble type are estimated from: 

• the mean total B — V colours as a function of Hubble 
type (Buta et al. 1994), and 

• subtraction of the bulge light by assuming a bulge 
colour B — V = 1.0 and a bulge to total light ratio (B/T) in 
B-band (Simien & de Vaucouleurs 1986). 

The total gas masses normalised by B-band disk luminosity 
(Mg/Ls) as a function of Hubble type are estimated from: 

• the mass of neutral atomic gas, calculated from the in- 
tegrated hydrogen index Hi (Buta et al. 1994) and a conver- 
sion formula in Third Reference Catalogue of Bright Galaxies 
(RC3) given by de Vaucouleurs et al. (1991), 
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Table 1. Integrated colours of spiral galaxies. 



de Jong (1996) RC3 Model 



Hubble type 


B/T 


B-V 


V-R 


V-I 


V-K 


U-V 


B-V 


V-R 


V-I 


U-V 


B-V 


V-R 


V-I 


V-K 


Sa 


0.41 


0.81 


0.54 


1.02 


2.94 


0.96 


0.74 


0.50 


1.14 


0.85 


0.76 


0.53 


1.17 


3.02 


Sb 


0.24 


0.74 


0.46 


1.04 


2.79 


0.66 


0.61 


0.46 


0.93 


0.63 


0.66 


0.49 


1.07 


2.84 


Sc 


0.09 


0.67 


0.53 


1.08 


2.84 


0.45 


0.53 


0.41 


0.87 


0.45 


0.57 


0.43 


0.95 


2.61 


Sd 


0.02 


0.59 


0.47 


1.02 


2.59 


0.37 


0.50 


0.39 


0.83 


0.36 


0.52 


0.41 


0.89 


2.47 


Sm 


0.00 


0.69 


0.41 


0.75 


2.30 


0.33 


0.50 


0.36 


0.72 


0.33 


0.50 


0.40 


0.87 


2.42 



• the ratio of molecular to atomic gas H2/H1 (Young & 
Knezek 1989), 

• a helium abundance correction of 25%, and 

• subtraction of the bulge light contribution to Lb- 

Disk properties in Fig. 1 are well reproduced by t = 
Tin = 5 Gyr model with an age To = 5 — 15 Gyr irre- 
spective of the Hubble type. The model also reproduces the 
age-met allicity relation and the [O/Fe] vs. [Fe/H] diagram of 
the stars in our own galaxy (Kodama 1997). The constraint 
on the time scales r and Tin is weak because of the large 
observational errors and the permitted range could be from 
2 to 8 Gyr (Fig. 1). However, as will be shown in the next 
section (§ 2.2), this uncertainty will not cause problems for 
the purposes of rcdshift determination because star forming 
timescale and B/T variations have degenerate effects. 

As an additional check of the validity of our models, the 
integrated colours of disk galaxies of different Hubble type 
are compared in Table 1. The observational data are mean 
Hubble type colours taken from dc Jong (1996) and the RC3 
(Buta et al. 1994; Buta & WilUams 1995). Note that each 
galaxy type has large intrinsic colour dispersion, typically 
as much as 0.05 — 0.2 mag in optical colours and 0.2 — 0.4 
mag in V — K. The data are compared to the model with 
appropriate B-band B/T ratio (Simieri & de Vaucouleurs 
1986). It is clear that the detailed trends of local galgixy 
colour with B-band B/T ratio are well reproduced by our 
models. 

2.2 Colour tracks 

Following the models introduced above, we simulate the 
colour evolution of galaxies as a function of redshift for va- 
riety of star formation histories. 

The two solid curves in Fig. 2 show the colour evolution 
in the observer's frame for a E model with a high metallicity 
(([M/H])=0.06), and a model which contains 50% contribu- 
tion of disk light in the i3-band at 2: = (see below). The 
redshift is changed from to 2 in steps of 0.05 as indicated 
by the dots along the lines. Four different colour-colour plots 
are shown, to cover a wide range in redshift, demonstrating 
that the most useful passbands for photometric rcdshift de- 
termination up to redshifts of ~ 1.5 typically bracket the 
4000 A break: 0.25 < z < OA for B - V vs. V - R colours, 
0.5 < z < 0.8 iorV - Rvs. R- I colours, 0.9 < z < 1.15 
ior R - I vs. I - K colours, and l.O < z < 1.5 for R - Z 
vs. Z — J colours. In the above redshift ranges, the mid- 
dle bands of each combination are passing through 4000 A 
break, the most prominent spectral feature in optical re- 
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Figure 3. The galaxy spectra at z = (upper panel). The flux is 
normalised at 4030 A. The thick solid line shows a giant elliptical 
El spectral template (Bica 1988) extended into the UV by the at- 
tachment of lUE spectra (Arimoto 1996). Three model spectra are 
superposed: a high mean metallicity model with ([M/H]) = 0.06 
(thin solid line), a lower metallicity model with ([M/H]) = —0.52 
(dotted line), and a high metallicity bulge plus disk model with 
B/T=0.5 (dashed line). See text for details of the models. The 
lower panel shows the normalised response functions of standard 
Johnson-Cousins' B,V and R filters, blueshifted to correspond to 
those at 2; = 0.3. 



gion, which plays an important role in redshift estimation. 
The horizontal colours redden rapidly with redshift while 
vertical colours stay nearly constant. At around z = 0.3 for 
example, as shown in Fig. 3, V band is just on the 4000 A 
break and B and V bands are losing flux rapidly, while the 
R band flux is approximately constant as the redshift in- 
creases. As a result V — R gets redder while B — V remains 
almost constant. On the other hand, B — V is more sensi- 
tive to changes in stellar population than V — R. Therefore, 
we can see that the effects caused by changes in redshift 
are almost perpendicular to those caused by changes in star 
formation history. 

In Fig. 2, six possible effects which change the colour 
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Figure 2. Colour-colour plots in the observer's frame. The two solid curves show the colour evolution of an old (Ta = 12 Gyr, or 
Zf = 4.5), high metallicity (([M/H])=0.06) E model (redder), and the B/T=0.5 model (bluer) which contains half of disk light in B-band 
at z = 0. The redshift is changing from to 2 in steps of 0.05, indicated by dots along the tracks. The six arrows shown from three 
redshift points along the track indicate the colour changes due to several different effects. See text for detail. 



evolution of a galaxy with redshift arc considered. The six 
arrows (indicated at three redshift points along the colour 
track) show the change in colour of an old, high metallicity 
elliptical model (To = 12 Gyr and ([M/H])=0.06) caused by 
the following effects. 

(i) metallicity — lower thick solid arrows: The mean 
stellar metallicity of the E models is changed from 



([M/H])=0.06 to -0.52. The galaxy age is fixed at 12 Gyr, 
corresponding to a formation redshift 2/ = 4.5. 

(ii) age — dashed arrows: The formation redshift of the E 
model is varied from Zf = 4.5 to 1.0 (4.5 to 2.0 for the RIK 
and RZJ diagrams), corresponding to galax;y ages Tq = 
12.0 to 8.4 Gyr (12.0 to 10.5 Gyr). Metallicity is fixed at 
{[M/H])=0.06. 

(iii) disk component — long dashed arrows: As outlined 
earlier, the models deal with disk galaxies by adding a star 
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forming disk component onto an E model, representing the 
galaxy bulge. The B-band B/T ratio at z = is changed 
from 100% to 0%. Note that B/T=0.41, 0.24 and 0.09 cor- 
responds to the Hubble types Sa, Sb and Sc, respectively 
(Simion & do Vaucoulours 1986). Note also that B/T ra- 
tio is a rising function with redshift, since the bulge is get- 
ting brighter with redshift while the disk brightness remains 
roughly constant up to 2 = 1. For example, the B/T ratio 
of 0.5 at z = actually increases to 0.8 at « = 1. We have 
also modelled the B/T sequence using [M/H]disk = —1.3 
and [M/H]disk = by forcing all the disk stars to have the 
same metallicity instead of following the chemical evolution 
of the disk. [M/H]disk is found to have a negligible effect on 
the colour tracks of the B/T sequence, and is not considered 
lu-reafter. The time scales t and Tin can be also changed, but 
it is found to be similar in effect to changes in B/T ratio at 
fixed T and Tin- This is because the colour of the B/T se- 
quence at a given redshift is essentially determined by the 
ratio of the current star formation rate and the total mass of 
the galaxy. This parameter can be adjusted either by chang- 
ing the time scales of the disk or by changing the B/T ratio. 
Thus the effects on the colour-colour diagrams of changing 
the timoscales is only to shorten or extend the vectors of the 
B/T sequence for a given B/T ratio. 

(iv) recent star burst — dotted arrows: The possible ef- 
fects of recent laxge-scale star formation axe considered by 
the addition of a recent star burst to the E model. The 
burst population is assumed to bo a simple stellar popu- 
lation (SSP) with solar metallicity. The arrow denotes the 
changes caused by a T;, = 0.5 Gyr old burst population cor- 
responding to 10% of the total stellar mass (/t = 0.1) at that 
redshift. The direction of the vector on the colour-colour di- 
agrams depends on Tt, but unless Tt is around 0.5 (±0.3) 
Gyr, the burst sequence follows either the B/T sequence or 
the age sequence closely. Main-sequence turn-off stars in the 
burst population with age ~ 0.5 Gyr have an effective tem- 
perature of about 10000 K and contribute significantly to 
the total flux at rest wavelengths of 3000-4000 A, creating 
aberrant colour changes in the colour-colour plots. 

(v) reddening - dash-dotted arrows: The extinction effect 
due to internal dust is estimated by using the extinction 
curve given by Mathis (1990). The full arrows correspond to 
Av = 0.5 mag. 

(vi) cosmology — upper thick solid arrows: The colour 
tracks have a weak dependence on the adopted cosmology. 
Other sets of cosmological parameters arc tested; ic. Ho = 
65,fio = 0.1, Ao = 0.0, and Ho = 80,^0 = 0.2, Ao = 0.8. 
The formation redshift Zf is fixed to 4.5 in all cases. The 
full arrows show the colour change for the latter cosmology. 
The colour change from the former cosmology is smaller and 
along a similar vector. 

The age and metallicity sequences (i) and (ii) are al- 
most indistinguishable in all the colour-colour plots for ages 
^ 1 Gyr. This reflects the age-metallicity degeneracy in- 
herent in old stellar populations (Worthey 1994). However, 
this degeneracy actually improves the prospects for the de- 
termination of photometric redshifts, as the effects of age 
and metallicity are quite distinct, given the right choice of 
passbands, to the effects of changing galaxy redshift. In ad- 
dition, it is clear from Fig. 2 that changes in assumed cos- 
mology and interstellar reddening also have colour effects 



similar to age and metallicity, with an opposite sense. As a 
result, E-type galaxies at a given redshift should populate 
in a restricted area on the colour-colour diagram (almost a 
single line) characteristic of that redshift, irrespective of its 
stellar population, regardless of interstellar reddening, and 
whichever cosmology is assumed. This means that it is pos- 
sible to assign redshifts to old stellar populations without 
prior knowledge of galaxy properties. 

However, the colour changes caused by the B/T se- 
quence (iii) are not entirely degenerate with those due to 
age and metallicity on the RIK and RZJ diagrams (Fig. 2), 
due to the presence of on-going star formation. This on-going 
star formation causes a bluer eg. R — I colour for a given 
I — K colour than either the effects of age and metallicity 
for z \ galaxies. Recent large bursts of star formation of 
age ~ 0.5 Gyr (iv) also lead to effects distinct from those of 
age and metallicity, and those of changing the B/T ratio. 

This can lead to considerable uncertainty in the esti- 
mation of galaxy redshift, as a given set of colours, on the 
basis of the colour-colour plots presented in Fig. 2, will be 
consistent with a wide range of redshifts, depending on how 
the colours are explained by our model; eg. by changing the 
B/T ratio, or the metallicity of the galaxy template. How- 
ever, if a passband with a short rest-frame wavelength is 
used, it is possible to discriminate the presence of young stel- 
lar populations photometrically, leading to a less ambiguous 
determination of redshift, and some information on the star 
formation history of the galaxy. This is illustrated in the up- 
per half of Table 2 (see also the lower left panel of Fig. 2), 
where we compare 3 galaxy templates with very similar red 
colours (7? — / ~ 1.35, I — K ~ 3.48) which present very dis- 
tinct colours in bluer passbands, allowing relatively easy dis- 
crimination between these possibilities. Another degeneracy 
apparent in Fig. 2 in redder passbands is that between high 
redshift, low B/T ratio galaxies, and lower redshift early- 
type galaxies. This, again, is illustrated in the lower half of 
Table 2 where we again see that bluer passbands allow easy 
splitting of this degeneracy. 

Another point to note from Fig. 2 is that when a par- 
ticular colour pair is selected to allow the accurate estima- 
tion of redshifts within a certain redshift range, this colour 
pair also provides a means of rejecting galaxies (particularly 
higher B/T objects) that lie outside this optimal redshift 
range (although the estimated redshifts will obviously be 
much less accurate for these objects). Problems will occur 
for much higher redshift objects, and objects with a small 
B/T ratio, as discussed above. 

Despite the demonstrated utility of the bluer passbands 
in 'breaking' degeneracies between galaxies which look iden- 
tical in red passbands, wc aim to use little of the colour infor- 
mation shortwards of 2500 A. Primarily, this is because the 
model spectra are ill-constrained for short UV wavelengths 
in both elliptical and star forming galaxies because of the 
effects of the UV-upturn (an anomalous rise in flux towards 
short UV wavelengths, observed in nearby giant ellipticals; 
eg., Burstein et al. 1988) and the uncertain effects of dust 
extinction (White, Keel, & Conselice 1996). The source of 
the UV-upturn is still poorly understood, and the model 
predictions for its source and effects arc still uncertain. If 
the UV-upturn comes from hot young stars, this population 
is actually considered in this model by superposing only a 
small fraction of on-going star formation onto the passively 
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Table 2. Colour degeneracies. 



z 


B/T 


mm 




B-R 


V -R 


R-I 


I -K 


0.9 


0.5 


0.06 





1.32 


0.86 


1.32 


3.50 


1.0 


1.0 


-0.52 





2.57 


1.32 


1.36 


3.49 


1.1 


1.0 


0.06 


20 


1.64 


1.09 


1.38 


3.46 


0.5 


1.0 


0.06 





2.98 


1.37 


1.13 


3.10 


0.7 


0.4 


0.06 





1.48 


0.89 


1.15 


2.98 


0.9 


0.2 


0.06 





0.80 


0.57 


1.09 


3.00 



evolving ellipticals. If, however, the source of the UV-upturn 
is hot horizontal branch stars, then it is necessary to fine 
tune the mass loss parameter along the red giant branch 
to reproduce the UV-upturn (cf. Yi, Demarque, & Oemler, 
1997). Even if this were the case, such hot horizontal branch 
stars would disappear at high redshift (2: ^ 1), which is our 
main region of interest , since the envelope mass of a horizon- 
tal branch star gets larger as the mass of a main sequence 
turn off star gets larger with look back time. 

An additional source of uncertainty in our models, espe- 
cially in the UV, is the neglect of the effects of dust extinc- 
tion on the colours of the stellar populations incorporating 
on-going star formation. This would at first appear to be a 
serious handicap, as disk-dominated galaxies clearly contain 
significant amounts of dust, especially in the spiral arms, 
where B band extinction Ab ^ 1 mag (White, Keel, & Con- 
selice 1996, Bcrlind ct al. 1997). However, by inspection of 
Fig. 2, it is clear that the colour changes at rest-frame opti- 
cal wavelengths are equivalent to increasing B/T ratio, age 
or metallicity, moaning that relatively largo uncertainties in 
dust reddening can be accommodated by changes in other 
galaxy parameters to compensate for these errors. This is 
also demonstrated in Table 1, where it is apparent that our 
models can accurately reproduce the colours of galaxies with 
ongoing star formation. This situation is unlikely to hold in 
the UV, however, as prescriptions for the dust extinction 
law start to diverge at these short wavelengths (Calzetti, 
Kinney, & Storchi-Bergmann 1994). 

Both the UV-upturn and the uncertain effects of dust 
reddening in the far-UV lead us to place little confidence 
in our model UV colours. We should therefore avoid this 
spectral region if possible. 

In addition, it should be noted that we neglect emission 
from star forming galaxies, such as the commonly observed 
[Oil ], [Oiii ] and Balmer features at locally (Kennicutt 1992) 
and at high redshifts (Hammer et al. 1997). This should not 
present a major problem, as the effects of line emission on 
broad band photometry is not large: A line width with an 
equivalent width of 20 A in emission would cause only ~ 0.02 
mag of brightening in the broad band magnitude. 

Finally, we note that although most of redshift range 
below 1.5 can be covered by the standard Johnson-Cousins 
system including Z band, there are some particular redshift 
range where we have larger errors in the estimated redshifts; 
ie. z < 0.25, 0.4 <« < 0.5, and 0.8 < z < 0.9. At these red- 
shift ranges, the cff'cct of changing redshift on colours is hard 
to be distinguished from that of changing stellar population 
(Fig. 2). If we want to handle clusters in these redshift ranges 
with better precision, we need to use passbands in other pho- 



tometric systems which properly bracket the 4000 A break 
at the cluster redshifts. 



3 BAYESIAN CLASSIFICATION 
3.1 Basic scheme 

A Bayesian approach allows us to incorporate our existing 
knowledge of galaxy populations, and thus to proportionally 
weight the areas of parameter space that we search. The 
Bayesian probability of a particular galaxy having a redshift 
z and bulge to total luminosity ratio B/T is given by the 
equation: 

PGai(z,B/T) = Pi(2,B/T|ms)P2(2,B/T), (5) 

where Pi{z, B/T|mB) is the probability of a given galaxy of 
aparent magnitude ms having a redshift z and bulge to total 
luminosity ratio B/T, and P2{z,'Q/T^) is the probability of 
a given galaxy reproducing the observed galaxy colours. We 
first deal with the evaluation of P2{z, B/T), ie. the probabil- 
ity of a given model galaxy reproducing the observed galaxy 
colours. 

The basic philosophical approach used for this redshift 
estimator is the comparison of a galaxy's location on a 
colour-colour plot and a finely-spaced grid of models su- 
perimposed on that plot to estimate the properties of that 
galaxy. The magnitudes of the observed galaxy are made into 
colours, and the errors in the colours used to make up a co- 
variance matrix, describing the sizes of the colour errors, and 
their relationships. Then, for all of the model galaxy colours, 
the difference between them and the observed colours axe 
calculated. Under the assumption that the photometric er- 
rors follow a Gaussian distribution, the probability that the 
model describes the galaxy colours adequately (^2(2, B/T)) 
is given by: 

P2{z,B/T) = l/[(27r)"|C|]i/^ exp{-l/2(M^C7-^w)}, (6) 

where n is the number of colours used, u is the vector of 
differences between the model galaxy colours and the ob- 
served colours, C is the covariance matrix of those colours, 
and |C| is the determinant of the covariance matrix. The di- 
agonal elements of C correspond to the variance in the indi- 
vidual colours. The off-diagonal elements correspond to the 
variance of any passbands in common between two colours, 
with the appropriate sign (which indicates whether the er- 
rors in a given passband affect the colours in the same or an 
opposite sense). Since we require galaxies to have small {<, 
0.1 mag) photometric errors in order to reliably determine 
their redshift, using a Gaussian rather than a log- normal is 
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Table 3. Adopted LF parameters. 



Type 


B/T 




a 


(xlO-3 Mpc-3) 


E 


1.0 


-20.74 


-0.85 


0.188 


SO 


0.6 


-20.25 


-0.94 


0.950 


Sa-Sb 


0.33 


-20.23 


-0.58 


1.088 


Sc-Im 


0.0 


-20.32 


-1.07 


0.625 



an adequate representation of the error distribution in each 

band. This assumption considerably shortens the probabil- 
ity computation time. If it is impossible to find a satisfactory 
match to the galaxy colours, the galaxy is omitted from fur- 
ther consideration. 

This procedure gives the probability that an observed 
galaxy is adequately described by a given model galaxy. This 
product is evaluated for a large number of plausible model 
galaxies with a range of B/T ratio and redshift. The other 
effects, such as age and metallicity of model galaxies are con- 
sidered later in § 3.3. These evaluations makes up a 'proba- 
bility map' on the piano of B/T ratio and redshift. In order 
for us to obtain the final probability, PGai(z,B/T) of the 
galaxy having a particular z and B/T, it is necessary to 
make up a prior distribution, given what is already known 
about galaxy populations. 



3.2 Prior distribution 

The quantity Pi(2;, B/T|ms) is our prior distribution. The 
effect of the prior is to modulate the redshift estimates pro- 
vided by the colour analysis by using magnitude informa- 
tion. Note that it makes little difference to the redshift es- 
timate (well within the error bars of the redshift determi- 
nation) unless there is a degeneracy, and the colour of two 
or more models satisfy the observational constraints equally 
well at different redshifts. In that case, it is designed to 
discover which one of those options is more likely to be ob- 
served, and weights the 'probability map' accordingly. 

In forming the prior distribution, we need to know the 
type dependent luminosity function (LF) $(mB,B/T) and 
the volume element dV/dzdQ. Using these two elements, the 
prior is given as follows: 

Pi{z,B/T\mB) = dV/dzdn^{mB,z,B/T). (7) 
These two parts are treated separately below. 

3. 2. 1 The local luminosity function 

In order to get $(ms , B/T) we use the local, type-dependent 
luminosity function (LF). However, there remains consider- 
able uncertainty in the type-dependent LF, as the splitting 
into morphological types is carried out in a number of dif- 
ferent ways, and the faint end slopes differ considerably be- 
tween different studies (Marzke et al. 1998, Bromley et al. 
1997, Marzke et al. 1994, BingcUi, Sandage, & Tammann 
1988). We chose to adopt a variant of Marzke ct al.'s (1994) 
determination in the Schechter (1976) form, which is param- 
eterised by a and Mg. The parameters for the observed 
local luminosity function are summarised in Table 3 as a 



function of the Hubble type. To connect between the Hub- 
ble types and B/T ratio, wc use Simien & de Vaucouleurs 
(1986). The characteristic magnitude of the LF, M*j3{B/T), 
corresponds to the apparent magnitude m|3(«,B/T) at a 
redshift z as: 



(8) 



m*B{z, B/T) = M^(B/T) + DM{z) + AMb{z, B/T), 

where DM is the distance modulus at redshift z in the 
adopted cosmology. AAfs is the absolute magnitude change 
in B-band in the observer's frame due to the luminosity 
evolution and the shift of the wavelength shortwards with 
redshift, and is taken from the model. In this way, we finally 
obtain the LF in apparent magnitude ms as a function of 
redshift and B/T ratio: 

$(ms,«, B/T) = 

Q2^*e^~°'^'^^"'^^^^"^^~"^*^^~''^^^~°'^^^'^^~"^*^^^^ (9) 

If the observed galaxy lacks B-band data, we use a prior 
in the band ncarlest to B. In this case, we make up the local 
LFs in the alternative band by shifting the B-band LFs using 
model colours of each type. 



3.2.2 Volume element 

The other essential ingredient of the prior is the volume el- 
ement dV/dzdO,. The formula for the volume element as a 
function of redshift was taken from Carroll, Press, & Turner 
(1992), with the addition of some factors of c to satisfy di- 
mensionality considerations, and allows variations in f2o, Ho 
and the inclusion of the cosmological constant via the term 
Ao: 



dV/dzdO. ■■ 



d. 



d{dM) 

{i + nk{HodM/cYY/^ dz ' 



(10) 



- r^'i^'i , and Ro is the scale 



where, ftk is given by 
factor of the universe and k is the curvature of the universe. 
The quantity dM is the proper motion distance, and in this 
case is given by: 

c 



duiz) 



^sinn(|fik|^/'^) 



(11) 



where 'sinn' is a function that equals sinh in an open uni- 
verse, sin in a closed universe, and disappears in a critical 
universe, and is given by: 



(12) 



J^= f [{l + z'f{l + Qoz')-z'{2 + z')Ao]-''^^dz', 
Jt) 

which must be integrated numerically for most non-trivial 
cosmologies. 



3.2.3 Comparison with observation 

The prior was used to calculate the n(z) distribution within 
a magnitude range 22.5 < niB < 24.0. This calculated dis- 
tribution was then compared to the observed redshift distri- 
bution of galaxies within the same magnitude range given 
by Glazebrook et al. (1995) and Cowie et al. (1996). The 
comparison is shown in Fig. 4. It is clear that the prior re- 
produces the overall form of the observed n{z) diagram. 
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22.5 < B < 24.0 

model 

Cowie et al. (1 
Glazebrook et al. (1995)' 




redshift (z) 



Figure 4. The observed ti(z) distributions for a sample of galax- 
ies in the magnitude range 22.5 < mg < 24.0 from Glazebrook 
ct al. (1995) and Cowic ct al. (1996) arc plotted against the ex- 
pectation for the n{z) distribution calculated from our prior dis- 
tribution. The overall shape is in adequate agreement with the 
observational data. 




0.6 0.8 
Redshift 



Raw Probobillty Map 



0.6 0.8 
Redshift 



Final Probability Map 



^ 

0.0 



0.6 G,8 

Redshift 



Figure 5. Probability maps for a given galaxy with my = 20.3. 
The contours indicate linear probabilities in steps of 0.1 from the 
maximum. Top panel shows the prior distribution (Pi), while the 
middle panel indicates the probability map from the colour infor- 
mation only (P2). The bottom one is the final combined proba- 
bility map. The prior rejects the high redshift solution (which is 
permitted, when considering only the galaxy's colours) based on 
the brighteness of the galaxy. 



within the estimated redshift errors. This experiment shows 
our method is robust to uncertainties in the prior estimation. 



3.24 The effect of the prior 

It should be noted that the prior distribution is quite 
model-dependent, because the type-dependent local LF is 
ill-constrained, and because the detailed typc-dcpcndent 
spectral evolution is poorly understood. Also, here wc make 
two assumptions; ie., that there is no number evolution of 
galaxies, and that there is no size dependent luminosity evo- 
lution, that is that galaxies with similar B/T ratios have the 
same colours at all redshifts, regardless of their total lumi- 
nosity. These assumptions and ingredients may be inade- 
quate to describe the real universe, especially in the context 
of a hierarchical clustering universe. However, these uncer- 
tainties are not so important, because the estimated redshift 
is essentially determined by the colour term (P2 in Eq. 5), 
and the prior (Pi) is used supplememtarily. The prior be- 
comes important when the solution from the colour term 
splits into multiple redshift ranges. In such a case, the prior 
works to avoid unreasonable solutions of redshift for a given 
apparent magnitude. This situation is illustrated in Fig. 5. 
The figure shows an example of the probability maps of a 
given galaxy. The colour term gives two solutions, one at low 
redshift {z ~ 0.2) and the other at high redshift {z ~ 1.0), 
but the prior rejects the solution with higher redshift based 
on the brightness of the galaxy. 

We also tested the effect of local LF on the final es- 
timated redshift through the prior by changing the LF 
paramters listed in Table 3. We shifted Mg by ±0.5 mag- 
nitude for all types, resulting in a shift of the redshift peak 
of the n(z) distribution in Fig.4 by =fO-1i and we tried fix- 
ing the faint end slope a at —1 for all types. In all cases, 
however, the change in the final estimated redshifts was well 



3.3 The models included in the classifier 

The estimates of redshift and galaxy type will depend quite 
sensitively on the detailed choice of model galaxy template. 
In § 2.2, we investigated the various effects on the galaxy 
colours, namely the effects of age, motallicity, disk light ad- 
dition, recent star bursts, reddening, and cosmology. How- 
ever, many of these effects were found to be degenerate with 
one another. In these models, effects due to age, metallic- 
ity, reddening and changes in cosmology are particularly de- 
generate. Therefore, by including only the metallicity effect 
explicitly in the classifier, we also cover the other three ef- 
fects at the same time as they behave just like metallicity 
variations. To this aim, we considered four B/T sequences 
with different bulge metallicities. Each sequence gives dif- 
ferent probability map on the redshift and B/T plane, and 
then they are combined into a single 'probability map' by 
performing a mean of the separate maps. This mean is in 
essence a weighted mean, as the most plausible metallic- 
ity for the model galaxies will have the best match to the 
colours. In this way, we take into account the metallicity 
effects explicitly, and hence the other degenerate effects. 

The remaining significant effect which is not covered is 
that of a recent burst of star formation. As seen in Fig. 2 
large amounts of relatively recent star formation can make 
a galaxy look as if it has a lower redshift than it actually 
does, if the colours are interpreted as being entirely due to 
redshift, B/T and metallicity effects. They might be simply 
assigned significantly underestimated redshifts. However, as 
already shown, unless burst strength fi is as high as > 15% 
and the burst age Ti, ~ 0.5 Gyr, the burst population colour 
change is similar to those resulting from other effects. These 
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populations should be short lived and are expected to be 
rare. Therefore omitting the recent star burst model will not 
affect our global redshift estimation. We should note, how- 
ever, this could be a problem if significant fraction of cluster 
members would be strongly affected by a recent burst, due 
to cluster-cluster merging for example. In such a case, we 
would need to include the extra set of models of recent star 
burst to correctly estimate redshifts, although it would lead 
to greater estimation errors. 



Table 4. Passband choice for random galaxy simulations. 







all 




E/SO 


passbands 


a(A2) 


(T (error) 


o-(Az) 


a{error) 


BVRIK 


0.065 


0.076 


0.060 


0.074 


RIK 


0.127 


0.135 


0.121 


0.129 


BVRI 


0.178 


0.199 


0.176 


0.210 



3.4 Error estimates 

At this stage, the redshift and galaxy type estimates are 
in the form of the 'probability map' PGai(«, B/T). How- 
ever, an estimate of a given galaxy's redshift and type is 
often required for eg. comparison with real redshifts, or 
cluster member identification. Best estimates for redshift 
(^estimated) and effective la confidence intervals (^i^iL^ed, 
^estimated) obtained by taking the Pcum(z) ~ 0.5 and the 
Pcuni{z) = [0.16,0.84] intervals, respectively, of the cumula- 
tive distribution: 

/.z /.B/T=l 

PCum{z)= dz' d(B/T)PGal(2',B/T). (13) 

Jo Jb/t=o 

These error estimates depend on the estimated photo- 
metric errors: through the use of the covariance matrix C , 
the error estimates in the photometry are explicitly included 
in the determination of P^^z, B/T). Large uncertainty in the 
colours propagates through into larger uncertainties in the 
(z, B/T) combinations capable of adequately reproducing 
the observed colours. It is important to know how much pho- 
tometric accuracy we need to achieve the error in redshift 
within the expected bound, as it will certainly constrain the 
accuracy of any future applications of this method. 

To produce redshift error estimates as a function of 
photometric accuracy, simulations using 100 galaxies with 
K < 20, chosen at random in the range of <B/T< 1 
and 0.2 < z < 1.8 were undertaken. The bulge metallic- 
ity {[M/H]) bulge was chosen randomly between 0.061 and 
—0.523. After allocating the model magnitude in each pass- 
band for each galaxy, Gaussian photometric errors were then 
applied. We used these magnitudes and photometric errors 
of the simulated galaxies in the course of the redshift esti- 
mation. Here BVRIK passbands were used. The quantities 
a{Az) and a{error), corresponding to the root mean square 
(RMS) of real and estimated redshift error respectively, were 
then plotted in Fig. 6 where: 

Az = ^^estimated ~ ^real, (14) 

/ max min \ tn /-i r^ 

error = (^estimated - ^estimated)/2, (15) 

aiAz) = ^JTAzr, (16) 



a{error) = V error^. (17) 

As seen from the solid lines, both a{Az) and a{error) in- 
crease with photometric error. Importantly, even the photo- 
metric error is as bad as 0.15 magnitudes in all bands, the 
average redshift error a{Az) is still kept smaller than 0.1. 
As for the estimated redshift error a {err or), it is roughly 
comparable to the photometric error. 



Next we consider the effect of mis-estimation of the pho- 
tometric error on the redshift estimation error. It is possible 
that if the errors are under- or over-estimated, the redshift 
estimator will make the distribution of likely (i;,B/T) too 
broad or multiply-peaked, reducing the accuracy of the red- 
shift estimate. Therefore, it is important to test the effects 
that uncertainty in the determination of the errors can have 
on the redshift estimate accuracy. We realise this situation 
by fixing the real photometric error at 0.071 mag and vary- 
ing the estimated photometric error going into the covari- 
ance matrix C. The result is shown by the dashed lines in 
Fig. 6. It is cleax that the under- or over-estimation of the 
photometric errors has little, if any, effect on the quality 
of redshift estimation a{Az). Errors in the determination 
of the photometric quality, do however have a marked ef- 
fect on the estimated quality of the redshift determination, 
given by a(error). It is clear, therefore that it is important 
to be careful in the estimation of the photometric errors in 
order to estimate the quality of the redshift determination 
effectively. 

The quality of the redshift estimation also sensitively 
depends on the passband choice. We investigated three sets 
of passbands for the randomly generated galaxies with 0.071 
mag photometric errors, and the results are summarised in 
Table 4. With RIK passbands, the result is about factor of 
two worse than the BVRIK case. This is because it gets 
harder to disentangle the colour degeneracies between lower 
redshift early-types and higher redshift late-types without 
using bluer colours B and V . If, instead, we do not use K- 
band colours, the quality of the redshift estimation worsens, 
as high redshift galaxies with z > 0.8 — 1.0 no longer have 
a passband longwards of the 4000 A break. It is therefore 
important to choose the passbands carefully for photomet- 
ric redshift estimation according to redshift ranges under 
consideration and the depth of the photometric sample. 



4 TESTING 

In this section, we focus on testing our method using pho- 
tometry for galaxies with known spectroscopic redshifts. Be- 
cause we wish to focus on the recovery of high redshift clus- 
ters at z ^ 1.0, it would be best to test with an exten- 
sive dataset for a real high redshift cluster. However, such 
data is not available at the moment, since we need both 
multi colour photometry covering the 4000 A break (at least 
3—4 bands) and spcctroscopically determined redshifts for 
individual galaxies. Therefore, we have decided to test our 
method with two independent sets of data: a well-studied 
cluster Abell 370 at z = 0.374, and the Hubble Deep Field. 
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Figure 6. The cfTccts of photometric error on the quaUty of redshift estimation. Simulations with 100 galaxies using the B, V, R, I and 
K passbands were used to assess the effects of changing photometric quality in magnitude in all bands (solid lines) . The RMS difference 
between the best estimate and the real redshift {a{Az)) is plotted against the Gaussian photometric error {aimag)) in the left panel, 
while the RMS redshift error (aierror)) is plotted in the right panel. The effects of under- or over-estimation of the photometric error 
were also assessed by keeping the real photometric error fixed at 0.071 mag, and varying the estimated photometric errors going into the 
covariance matrix C (dashed lines). 




Figure 7. Field galaxies in the Abell 370 cluster field. Estimated redshift vs. spectroscopically determined redshift (left), and redshift 
error vs. estimated B/T ratio (right). Filled circles indicate E/SO galaxies, while open circles indicate disk galaxies and those without 
morphological information. 
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Figure 8. Cluster members of Abell 370. Distribution of the estimated redshifts (left) and redshift error vs. estimated B/T ratio (right). 
The filled histogram and filled circles indicate E/SO galaxies, while the open histogram and open circles indicate disk galaxies and those 
without morphological information. The dotted and dashed histograms show the real redshifts of all types and E/SO's, respectively. The 
two dotted lines (right) show the region of |A2| < 0.1 where galaxies are taken to be cluster members. 



4.1 Abell 370 

The photometric data are taken from Pickles & van der 
Kruit (1991). We use their BVRI photometry, using a 7 
arcsec aperture. For our use, we selected 59 galaxies which 
have spectroscopically determined redshifts. The redshifts 
are mainly taken from Pickles & van der Kruit (1991) and 
supplcmcntarily from Stanford, Eiscnhardt, & Dickinson 
(1995), although the latter gives only cluster memberships 
to which we assigned the cluster mean redshift z = 0.374. 
To separate the sample of E/SO galaxies, we use galaxy mor- 
phology as given by HST images (Stanford et al. 1995). As 
Abell 370 has a redshift z — 0.374, the filter combination 
{BVRI) is expected to work to pick out cluster members 
to some extent as they bracket the 4000 A break (§ 2.2), al- 
though U -band is missing which is important to discriminate 
galaxies with lower B/T ratios at the cluster redshift from 
those with higher B/T ratio at lower redshifts. However, un- 
fortunately, the photometric accuracy is poor, especially in 
B and / bands (0.13-0.19 mag in B, 0.04-0.08 mag in V, 
0.03-0.07 mag in R and 0.08-0.16 in 7). 

We estimated the photometric redshifts for our sam- 
ple galaxies. The results for field galaxies and cluster mem- 
bers are shown separately in Figs. 7 and 8, respectively. The 
cluster members are defined as those which have spectro- 
scopic redshifts 0.374 - 0.02 < z < 0.374 -I- 0.02. As for 
the E/SO galaxies, we can estimate the redshifts very well 
within |Az| < 0.1 both for field galaxies and cluster mem- 
bers. On the other hand, some disk galaxies have over- or 
under-estimated redshifts. In Fig. 7, there is a galaxy with 
Az > 0.4. That is because the photometry in 7-band of this 
galaxy is very poor as indicated in the original table in Pick- 



les & van der Kruit (1991), and in fact if we use only BVR 
bands for this galaxy, the estimated redshift agrees with the 
real redshift at the 1.5 a level. The other two field disk 
galaxies are slightly underestimated by Az ~ —0.1. These 
galaxies have very blue colours in B — V ot V — R, and it 
is suggested that they are either disk dominated galaxies 
or the ones strongly influenced by a recent star burst. For 
the cluster members (Fig. 8) also, there are some galaxies 
whose redshifts are underestimated as much as Az ^ —0.15. 
These galaxies tend to have low estimated B/T ratios and 
are again degenerate with galaxies at lower redshift. The dis- 
crimination between a blue cluster member and a slightly 
redder galaxy at lower redshift can be difficult, especially 
when we lack a bluer band corresponding to far-UV region 
(2500-3000 A). 

Nevertheless, if we adopt the criteria of defining cluster 

members as |A2| < 0.1, we can pick out most of the cluster 
members with little field contamination as shown in Figs. 9 
and 10. This is especially true for early-type galaxies. As a 
result, the C-M relation of the E/SO galaxies is well recov- 
ered (Fig. 9). The solid line shows the real relation for the 
real cluster members while the dashed line shows the esti- 
mated relation for the estimated cluster members. We used 
a bi- weight fitting method to calculate these C-M relations 
(Beers, Flynn, & Gebhaxdt 1990). Both relations are nearly 
identical. 

In summary, although the photometric accuracy is not 
ideal, we can still pick out most of the cluster members in 
A370, especially early-type galaxies, only photometrically 
based on our method. The field contamination is negligibly 
small. The method has difficulty in recovering cluster mem- 
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Figure 9. The C-M diagram for Abell 370. The filled symbols in- 
dicate cluster members, while the open symbols show field galax- 
ies. The symbols surrounded by a large circle indicate the esti- 
mated cluster members selected using |A2| < 0.1. Two error bars 
at the lower part of the figure indicate the typical one sigma ob- 
servational errors. The solid and dashed lines indicate the C-M 
relation of the the real cluster E/SO's and that of the estimated 
cluster E/SO's, respectively, calculated using bi-weight fitting to 
the data. 



Figure 10. The colour histogram for Abell 370. The solid line 
shows the real cluster members, and the dotted line shows the 
real field galaxies. The slantwise hatched area indicates recov- 
ered cluster members, while the black shaded area indicates field 
contamination. It is shown that most of the cluster members are 
recovered, although a small number of the bluest galaxies are 
dropped out. This would be improved if the !7-band data were 
available. Field contamination is also negligibly small. 



bers bluer than V — R = 0.6, but this would be improved if 
?7-band data were available. 



4.2 Hubble Deep Field 

To further test our method, we apply it to galaxies 
taken from the Hubble Deep Field. The galaxies used 
here are chosen from Cowie's -selected galaxy sam- 
ple (http://www.ifa.hawaii.edu/'^cowie/k_table.html), all of 
which have spectroscopic redshifts, mainly from Cohen 
et al. (1996). Isophotal magnitudes in four HST WFPC2 
filters (F300W[?7300], F450W[B450], F606W[F606], and 
F814W[7814]) are taken from Williams et al. (1996). We 
cross-identify galaxies between Cowie's catalogue and the 
photometry catalogue using RA and DEC. We choose only 
isolated galaxies to avoid mis-identification. Also we ex- 
cluded galaxies with z > 2, as the current passbands no 
longer bracket the 4000 A break. Photometric errors are 
calculated from S/N ratios, but for those less than 0.05 
magnitude, we assume a minimum error of 0.05 magnitude. 
We give larger minimum error of 0.2 magnitude to C/300 for 
galaxies with 2:esTimated > 0.2, and to B450 for galaxies with 
■s^estlmated > 0-8; ^ Order to avoid incorporating uncertain 
model far-UV colours. This is an iterative approach, but is 
unavoidable given the uncertainty of the model UV spec- 
trum. The infrared photometry data in J are acquired from 
Cowie's table. A photometric accuracy of 0.1 magnitude is 
assumed in J-band as no information is given. 



Applying the model directly to this data set, we find a 
systematic offset of Az ~ —0.1 between the real redshifts 
and the estimated redshifts. The most likely cause of this 
discrepancy is a zero-point mismatch of order 0.1 magni- 
tude between the data and the model in such a way that 
the model is slightly redder in optical colours and a bit 
bluer in far-UV colour. It might be the intrinsic zero-point 
uncertainty in the model, since it is comparable to the limi- 
tation of the current population synthesis models (Chariot, 
Worthey, & Bressan, 1996), although it is puzzling that a 
similar problem is not seen in Abell 370. To correct this 
situation, we shift the model zero-points for this case only: 
ie. -1-0.1 magnitude is added onto the model [7300, 1814, 
and J. With this zero-point shift, most of the redshifts of 
HDF galaxies are correctly estimated as shown in Fig 11. 
The RMS errors of the estimated redshift are smaller than 
0.1; ie. (t(A«) = 0.091 and cr{error) = 0.075. Although the 
zero-point mismatch is a problem, it is encouraging that our 
method can estimate redshifts correctly over a wide range 
of redshifts. However this exercise makes it clear that the 
model should be calibrated with real data rather than being 
applied blind to high redshift systems. This can be acheived 
using a handfuU of spectroscopically confirmed members of 
the target cluster without compromising the overall aim of 
examining the star formation histories of the galaxy popu- 
lations. 
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Figure 11. HDF galaxies. C/300, B450, y606, /814, and J passbands are used. Filled symbols indicate morphologically classified E/SO 
galaxies, while the rest show other type galaxies or unclassified galaxies. 



Table 5. Simulated galaxies in a 2 = 1 cluster field. 





type 


n 


{[M/H]>Wge 




B/T 


cluster 


E/SO 


10 


0.06 - 


- -0.52 


4.5 


1.0 






10 


0.06 - 


- -0.52 


4.5 ~ 1.5 


1.0 






10 


0.06 - 


- -0.52 


4.5 


1.0 ~ 0.5 




Sp 


20 


0.06 - 


- -0.52 


4.5 


0.5 ~ 0.0 


field 


E 


6 


0.06 - 


- -0.52 


4.5 


1.0 ~ 0.6 




SO 


26 


0.06 - 


- -0.52 


4.5 


0.6 ~ 0.5 




Sab 


17 


0.06 - 


- -0.52 


4.5 


0.5 ~ 0.15 




Sc-Im 


5 


0.06 - 


- -0.52 


4.5 


0.15 ~ 0.05 



5 APPLICATION TO HIGH REDSHIFT 
CLUSTERS 

We are interested in applying this classifier to high redshift 
clusters around z ^ 1.0, but, at present, a suitable data set 
is not avilable. To show the applicability of the classifier 
to targets at that redshift, we simulated a. z = 1 cluster 
field using the model described in § 2. Although this is a 
self-consistency check (most importantly, it assumes that 
the photometric properties of real galaxies are accurately 
described by the stellar population synthesis code) it allows 
us to estimate the biases present in the recovered galaxies 
samples and to determine how much photometric accuracy 
and which combination of passbands is required to pick out 
cluster members effectively in such a cluster. 

We generated field galaxies using the type dependent 
prior distribution outlined in § 3.2. The metallicity of the 
bulge was chosen so that M^^^" = —23,-20 and —17 mear 
sured at z = corresponding to {[M/H]) bulge = 0.06, —0.23, 



and —0.52. Here we have simulated a if-limited galaxy sam- 
ple with rriK < 20 for 1 arcmin^ field of view which corre- 
sponds to 0.5 Mpc X 0.5 Mpc &t z — 1, using the prior dis- 
tribution, taking the type-dependent LF into account. The 
number of galaxies in each type is summarised in the lower 
half of Table 5. As for the cluster members, we assumed the 
mix of galaxy populations given in upper half of Table 5; 
io. 10 E/SO's from a metallicity sequence, another 10 E/SO's 
from an age sequence, and the other 10 E/SO's from a B/T 
sequence, and finally 20 disk galaxies are added in. Firstly, 
by using if -band luminosity functions of high redshift clus- 
ter galaxies (mean redshift 0.43) which are given separately 
for E/SO's and Spirals (Bargor ot al. 1998), we assigned K- 
band absolute magnitude aX z = 0.43 for a given galaxy. 
Secondly, a formation epoch (zf) and a B/T ratio are ran- 
domly assigned in the respective range given in Table 5. 
Then we can assign its bulge metallicity using its My"^^" 
aX z = calculated from Mk at z = 0.43. If a galaxy has 
rriK > 20 at a = 1, it is rejected from our sample, and the 
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Table 6. The bi-weight scatters and slopes of the C-M relation, 
the values for all types and E/SO galaxies axe shown separately. 
The scatters are measured with respect to the C-M relation of 
E/SO galaxies only. 







Abell 370 


z = I cluster 






all 


E/SO 


all 


E/SO 


scatter 


real 


0.125 


0.073 


0.613 


0.169 




estimated 


0.112 


0.072 


0.603 


0.189 


slope 


real 


-0.018 


-0.008 


-0.423 


-0.165 




(*sl iiiialcd 


-0,018 


-O.OK) 


-0.:-!81 


-0.165 



process is repeated until we finally obtain 50 cluster galax- 
ies in total. We assigned the model magnitudes in various 
bands for each galaxy both in the field and in the cluster. A 

Gaussian photometric error with a = 0.071 is added on each 
colour of each galaxy. We then regard these generated pho- 
tometric data as the observational ones for the 2 = 1 cluster 
field, and the redshift classifier is applied to each galaxy to 
estimate a redshift. 



3 



2 



1 1 1 > 1 1 1 1 1 1 1 1 1 1 1 
z=l cluster 


1 1 1 1 1 1 1 1 1 1 1 1 
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~ . , . , 1 . , . , 1 . . . . 1 


, , , , 1 , , , , 1 , 
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5.1 Biases in the recovered galaxy properties 

The results for the field galaxies and the cluster members 
arc shown separately in Figs. 12 and 13. We used VRIK 
colours to estimate the redshifts. B-band was not used, since 
at 2 = 1 it falls in the far-UV spectral region, well below 
2500 A. The overall agreement between the estimated red- 
shift and the real redshift is excellent. Most of the galaxy 
redshifts are well recovered within \Az\ =0.1, regardless of 
real redshift and irrespective of galaxy type. As a result, as 
shown in Fig 14, the recovery of cluster members is mag- 
nificent. Here we adopted the criterion of cluster members 
as \Az\ < 0.1, considering the photometric accuracy (see 
Fig. 6). It is also adequate since it does not pick up large 
amount of field contamination at high redshifts. In this case, 
only one cluster spiral is dropped out. Field contamination 
is also negligible (only four galaxies) . We have recovered not 
only old ellipticals, but also young or star forming ellipticals 
and spirals £is well. To see the bias in the identification of 
cluster members as a function of galaxy colour, we show the 
colour histogram of the recovered cluster members and the 
field contamination in Fig. 15. As is clear from the figure, 
there is no colour bias at all in either the cluster or the field. 
Consequently, we recover the C-M relation of E/SO galaxies 
very well (identically in this case) , as shown by the solid line 
in the figure. The bi-weight scatters around the relation are 
also calculated and given in Table 6 as well as the values of 
the C-M slope. The numbers for Abell 370 are also given in 
the same table. Both scatters and slopes are almost correctly 
estimated irrespective of galaxy type. 

All the above results are encouraging. If the models were 
perfect, we could assign redshift with 0.1 accuracy or better 
with <0.1 mag photometric errors in all bands. With this 
success, we will be able to extend the C-M relation analysis 
(eg., Kodama et al. 1998; Ellis et al. 1997; Stanford, Eisen- 
hardt, & Dickinson 1998) to 2: ^ 1 clusters without taking 
spectroscopic redshifts. Importantly, we can pick out cluster 
galaxies with various stellar populations; ie. not only pas- 
sively evolving old galaxies but also the galaxies which have 
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Figure 14. The C-M diagram for a simulated cluster field at 
z = 1. Caption is the same as Fig. 9. Note that the real C-M 
relation and the estimated one axe identical (solid line). 



B 




2 3 4 5 6 7 

R - K 



Figure 15. The colour histogram of a simulated cluster field at 
2 = 1. The solid line shows the real cluster members, and the dot- 
ted line shows the real field galaxies. The slantwise hatched area 
indicates recovered cluster members, while the black shaded area 
indicates field contamination. There are few dropped out mem- 
bers and little field contamination, and importantly they have no 
bias in colours. 
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Figure 12. Field galaxies in the simulated cluster field at z = 1. Plotted are the estimated redshift vs. input real redshift (left), and 
redshift error vs. input real B/T ratio (right). Redshifts are estimated using VRIK passbands with random Gaussian photometric errors 
of <T = 0.071 magnitude in all bands. Filled circles indicate E/SO galaxies defined as B/T> 0.5, while open circles indicate disk galaxies 
with B/T< 0.5. 




Figure 13. Cluster members in the simulated cluster field at 2 = 1. Distribution of the estimated redshifts (left) and redshift error vs. 
input real B/T ratio (right). Caption for the lines and the symbols are the same as Fig. 8. 
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a significant contribution from younger stellar populations. 
This is encouraging, as it is important to select the cluster 
members with as little bias as possible. Our method will 
allow us to conduct the colour scatter analysis around the 
C-M relation reliably, and to look into the age dispersion of 
cluster galaxies at high redshifts, if any. 

5.2 Optimal passbands for cluster identification 

In Table 7, we summarise the effect of passband choice on the 
estimated redshift error. Photometric errors of 0.071 mag are 

assumed in all passbands. Two cases in particular are found 
to be ideal: BVRIK and VRIK. The redshift errors arc 
always well below 0.1 regardless of galaxy type, and hence 
the number of galaxies dropped out of our cluster sample, 
and the field contamination, arc minimised. It is encour- 
aging that we can do comparably well without B-band, as 
it is good to minimise the use of passbands shortwards of 
rest frame 2500 A where possible. It should be noted that 
VI K and VRK work comparably well, although cr [error) is 
larger, especially for galaxies with ongoing star formation. 
This is because it becomes difficult to separate the effects of 
changes in stellar population and those of changing redshift. 
If both B and F-band are missing (and we have RIK only), 
we tend to underestimate the redshift of bluer galaxies. This 
is analogous to lacking L'^-band for Abell 370: it becomes dif- 
ficult to disentangle galaxy type and redshift without the UV 
colours for star forming galaxies. For the early type galax- 
ies, however, the redshift errors are reasonable, as we would 
expect from Fig. 2. In contrast, if K band is missing, the 
errors in the redshift estimation become much larger, irre- 
spective of galaxy type, as there is no passband longwards 
of the rest-frame 4000 A break. In this context, it is crucial 
for high redshift work to have both optical and near infrared 
passbands for accurate redshift estimation. 



6 SUMMARY 

We present a new photometric redshift estimator, which is 
optimised for the identification and study of galaxy clusters 
at high redshifts. We use only several broad passbands cover- 
ing the 4000 A break, and find in practice that it is possible 
to avoid the use of the uncertain colours shortwards of rest 
frame 2500 A. In our models, we considered as wide a variety 
of stellar populations as is possible to minimise the selection 
bias in the recovered cluster members. As most of the effects 
of changing stellar population on the integrated colours are 
highly degenerate, we find that it is possible to estimate red- 
shifts with reasonable accuracy for a range of galaxy types, 
ranging from those with old, passively evolving stellar pop- 
ulations, through to those with younger stellar populations 
and on-going star formation. 

Following the success in testing our method with data 
from Abell 370 and from the Hubble Deep Field, we applied 
it to a simulated cluster aX z = 1. We have shown that the es- 
timation of redshifts with accuracies better than |A2;| < 0.1 
can be achieved with multi-passband photometry of mod- 
erate quality (;$ 0.1 mag) in a small number of passbands, 
and the cluster members can be reliably identified. There- 
fore, the recovery of the C-M relation both in terms of the 
slope and scatter is expected to be accurate and almost free 



from any selection bias. We now have a moans of analysing 
the photometric properties of cluster galaxies at very high 
redshifts without a thorough spectroscopic membership con- 
firmation. 
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Table 7. Passband choice for a simulated cluster field at 2 = 1. Numbers of dropped out members and field contamination are also 
presented. Percentage of dropped members and field contamination are defined per real cluster members and per estimated cluster 
members, respectively. 
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rri f^'r'T'/'Yr' 1 
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E/SO 


all 
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3 (6%) 


3 (10%) 


5 (10%) 


4 (13%) 


VRIK 


0.079 


0.099 


0.064 


0.095 


1 (2) 
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